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44. The method of any one of claims 174l , wherein binding of said test compound to 
said polypeptide is detected by the use of a^<5mpetition binding assay. 



\ 45. The method of an^dne of claims 37-41, wherein said test compound 

modulates the activity of said polypeptide. 

: f 

REMARKS 

Claims 1-26 were pending in the application. Claims 1-19 and 22-26 have been cancelled 
without prejudice, as being directed to a non-elected invention. Applicant reserves the right to 
traverse the restriction between the non-elected Groups in this or a separate application. Claims 

20 and 21 have also been cancelled and new claims 27-45 have been added. Accordingly, after 
the amendments presented herein have been entered, claims 27-45 will be pending. For the 
Examiner's convenience all of the pending claims are set forth in Appendix A. 

Support for the new claims can be found throughout the specification and the claims as 
originally filed. Specifically, support for claims 27-30 and 37-40 can be found in claims 20 and 

21 as originally filed. Support for new claims 3 1 and 41 can be found at page 5, lines 7-10 of the 
specification. Support for new claims 32-34 can be found at page 45, lines 7-17 of the 
specification. Support for new claim 35 can be found at page 45, lines 7-10 and page 27, lines 
30-32 of the specification. Support for new claim 42 can be found at page 48, line 32 through 
page 49, line 5 of the specification. Support for new claim 43 can be found at page 45, line 33 
through page 46, line 1 8 of the specification. Finally, support for new claim 44 can be found at 
page 46, lines 1 1-16 of the specification. 

Attached hereto is a marked-up version of the changes made to the specification and 
claims by the current amendments. The attached page is captioned "Version With Markings to 
Show Changes Made". 

At paragraph 3 of the present Office Action, the Examiner has requested that a paper 
copy of the sequence listing be submitted by the Applicant. Applicant respectfully submits that, 
pursuant to 37 C.F.R. § 1 .821 (e), the Examiner was authorized to use the paper and computer 
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readable forms of the Sequence Listing of the parent application Serial No: 09/439,165. 
Nonetheless, for the Examiner's convenience, a paper copy of the sequence listing is being 
submitted herewith. 

No new matter has been added. Any cancellation of the claims should in no way be 
construed as an acquiescence to any of the Examiner's rejections and was done solely to expedite 
the prosecution of the application. Applicant reserves the right to pursue the claims as originally 
filed in this or a separate application(s). 

Objection to Claims 20 and 21 

The Examiner has objected to claims 20 and 21 as being "of improper dependent form for 
failing to further limit the subject matter of a previous claim." 

Applicant has cancelled claims 20 and 21, thus, rendering this objection moot. In view of 
the foregoing, Applicant respectfully requests that the Examiner reconsider and withdraw this 
objection. 

Objections to the Disclosure 

The Examiner has objected to the numbering of Figure 1 because, "partial views of a 
drawing which are intended to form one complete view must be identified by the same number 
followed by a capital letter." 

Applicant respectfully submits that the Figures and the Brief Description of the Drawings 
have been amended according to the Examiner's request. A substitute copy of Figure 1, as 
amended, is submitted herewith. In view of the foregoing, Applicant respectfully requests that 
the Examiner reconsider and withdraw the foregoing objection to the disclosure. 

Rejection of Claims 20 and 21 Under 35 U.S.C. § 112, First Paragraph 

The Examiner has rejected claims 20 and 21 under 35 U.S.C. § 1 12, first paragraph 
because, according to the Examiner, "the specification, while being enabling for the practice of 
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a method of identifying a ligand which binds to a receptor protein comprising the amino acid 
sequence presented in SEQ ID NO:5 of the instant specification, does not reasonably provide 
enablement for the practice of a binding assay which employs a protein having anything less than 
the entire amino acid sequence presented in SEQ ID NOs:5, 1 1 or 20." {Emphasis added). 



In particular, the Examiner states that 

[b]ecause the instant specification does not identify those amino acid residues 
in the amino acid sequence of SEQ ID NO:5 which are essential for biological 
activity and structural integrity of a h VR-2 protein and those residues which 
are either expendable or substitutable, a practitioner can not make and use a 
polypeptide lacking that entire sequence with any reasonable expectation that 
the altered protein would respond in an authentic manner. In the absence of 
such structure-function information a practitioner would have to resort to a 
substantial amount of undue experimentation in the form of insertional, 
deletional and substitutional mutation analysis of over 760 amino acid 
residues before they could even begin to rationally design an hVR-2 
polypeptide having other than a natural amino acid sequence and which 
responds in a binding assay in a manner which is representative of the native 
protein. 

Applicant respectfully traverses the foregoing rejection for the following reasons. 

The pending claims are directed to methods for identifying a compound which binds to a 
polypeptide comprising the amino acid sequence of SEQ ID NOs:5 or 20 or a fragment of the 
amino acid sequence of SEQ ID NO:5 by :i) contacting the polypeptide, or a cell expressing the 
polypeptide with a test compound; and ii) determining whether the polypeptide binds to the test 
compound. 

With respect to the claimed methods that use the entire polypeptide of SEQ ID NOs:5 or 
20, Applicant respectfully submits that the Examiner has admitted that the specification is 
"enabling for the practice of a method of identifying a ligand which binds to a receptor protein 
comprising the amino acid sequence presented in SEQ ID NO:5" (see page 3 of the present 
Office Action). Accordingly, Applicant respectfully requests that the Examiner indicate as 
allowable claims 27-30 and 37-40, (and claims depending therefrom) which are directed to 
methods that use the entire polypeptide sequence of SEQ ID NOs:5 or 20. 

With respect to the claimed methods that use fragments of the polypeptide of SEQ ID 
NO:5, Applicant respectfully submits that based on the teachings in Applicant's specification, 
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and the knowledge generally available in the art at the time of filing, the skilled artisan would be 
able to practice the claimed methods using only routine experimentation. 

To begin with, Applicant has taught in the instant specification which regions of the VR- 
2 molecules are important for activity and, thus, which regions of the molecule would respond in 
a binding assay in a manner which is representative of the native protein. Specifically, as taught 
in the specification (at, for example, page 10, lines 1-35), the VR-2 polypeptide contains ankyrin 
repeats, transmembrane domains, and at least one proline rich domain, all characteristic and 
necessary for the function of the Capsaicin/Vanilloid family of receptors. Based on these 
teachings the skilled artisan would be equiped to use a fragment containing one or more of these 
domains to identify compounds that bind to a domain and modulate an activity of the 
Capsaicin/V anilloid receptor. Compounds identified in such a manner would be of practical 
value, e.g., they could be used as antagonists of a Capsaicin/Vanilloid receptor. For example, a 
fragment of a VR-2 polypeptide that contains the proline rich domain may be used by the skilled 
artisan to identify compounds capable of binding the proline rich domain and disrupting 
interactions with SH3 domain-containing proteins that act downstream of the VR-2 receptor in a 
pain signaling pathway. Such a compound would be expected to inhibit the function of the VR-2 
polypeptide by disrupting the VR-2 signaling pathway. 

As indicated by U.S. Patent Nos. 6,214,564 6,214,559, and 6,187,552 (copies of which 
are submitted herewith as Appendices B, C and D, respectively) fragments of proteins are 
typically used in screening assays, such as binding assays. In fact, the use of polypeptide 
fragments in such assays, instead of a full length protein molecule, may be preferable in some 
situations because of the ease in preparing and handling of the smaller polypeptide fragments. 
Screening assays utilizing polypeptide fragments are routinely used in the art and have resulted 
in the identification of a plethora of useful compounds (including a number of commercially 
available drugs). 

Moreover, Applicant has taught how such fragments would be generated and used in 
binding assays. For example, at page 27, lines 5-28 of the specification, Applicant teaches 
methods for generating and screening fragments of VR-2 receptors. Specifically, Applicant 
teaches methods to generate a library of VR-2 fragments by creating double stranded DNA 
fragments that encode N-terminal, C-terminal and internal fragments of various sizes. Applicant 
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further teaches how a skilled artisan would insert these fragments into an expression vector and 
how one would assay the resulting VR-2 fragments. Based on these teachings in Applicant's 
, specification the skilled artisan would be able to make and use fragments of the VR-2 receptor. 

The Examiner further states that 



because the differences between the amino acid sequence of the 
hVR- 2 protein of the instant invention and that of hVR-1 are 
greater than the similarities, one would not conclude that these two 
proteins bind the same spectrum of ligands or modulate the same 
cellular processes. 

Applicant respectfully traverses this rejection for the following reasons. Applicant has 
shown a sequence alignment of human VR-1 (SEQ ID NO:2) and human VR-2 (SEQ ID NO:5) 
that shows amino acid identity between the two polypeptides. Both polypeptides are 
characterized by the same biologically active domains, e.g., ankyrin repeats, transmembrane 
domains, and at least one proline rich domain, and both polypeptides are expressed in the same 
type of trigeminal sensory neurons known to be involved in nociception. Applicant has also 
taught in the specification that the claimed VR-2 molecules are part of a calcium regulation 
system involved in pain response. Based on these teachings an ordinary skilled artisan would 
conclude that the claimed receptor is, indeed, a capsaicin/vanilliod receptor belonging in the 
same family as the hVR-1 receptor. 

The Examiner is also of the opinion that 

the instant specification does not provide a method through which 
an artisan can determine if a derivative of the disclosed protein 
retains functionality because the instant specification discloses 
neither a demonstrated ligand for the instant receptor or the 
pathway through which it has been shown to signal. A receptor, by 
definition, must bind a ligand and transduce a signal. To determine 
if a derivative of a receptor has retained its function an artisan must 
be able to measure both of these activities. Since the instant 
specification does not identify a ligand which has been 
demonstrated to bind to the disclosed protein, it is not possible to 
determine if ligand-binding is retained. 
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Contrary to the Examiner's assertions the identity of the natural ligand of the VR-2 
receptor proteins described in the specification is not requisite to the enablement of the claimed 
nucleic acid molecules. For example, Silverman et al (1998) and U.S. Patent No. 6,1 10,693, 
(provided herewith as Appendices E and F, respectively), describe methods of assaying GPCR 
activity without knowledge of the receptor ligand. Modulators of VR-2 activity can be identified 
by one of skill in the art without knowledge of the natural ligand through methods known in the 
art, such as, for example, the methods described in the references cited above, e.g., the high- 
throughput cell-based methods described in Silverman et al. 

The Examiner states that the claims 

encompass an assay which requires a practitioner to measure VR -2 
activity in response to the binding of a ligand to that receptor. To practice 
the claimed assay one must be able to measure a signal which is 
transduced by the ligand activation of an hVR-2 protein and the 
instant application does not identify a specific cellular signal which 
has been shown to be modulated by the binding of a ligand to a 
receptor of the instant invention 

Applicant respectfully traverses this rejection for the following reasons. Contrary to the 
Examiner's assertions, Applicant has taught multiple methods for measuring binding of a ligand 
to a VR-2 receptor. For example, Applicant has taught methods for measuring changes in 
intracellular calcium levels and membrane depolarization, signals which are modulated by the 
binding of a ligand to a VR-2 receptor (see, for example, page 45, lines 10-14 or page 27, lines 
29-37 of the specification). Applicant further teaches methods to measure binding of a ligand to 
a VR-2 receptor in vitro. At page 45, line 18 through page 48, line 16, Applicant teaches 
methods for measuring the interaction of a test compound with VR-2 receptor, e.g., coupling the 
VR-2 substrate with an enzymatic or radioisotopic label, measuring the cytosensor, and 
measuring association and disassociation using surface plasmon resonance. Based on these 
teachings in the Applicant's specification an ordinary skilled artisan would be able to measure 
the binding of a ligand to the claimed VR-2 receptors {e.g., by measuring a signal that is 
generated by binding of a ligand to the receptor). 
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In view of the foregoing teachings in Applicant's specification and the knowledge 
generally available to the skilled artisan at the time of the invention, the skilled artisan would be 
able to practice the claimed invention using only routine experimentation. Accordingly, 
Applicant respectfully requests that the Examiner reconsider and withdraw the foregoing 
rejection. 

Rejection of Claims 20 and 21 Under 35 U.S.C. § 112, First Paragraph 

The Examiner has rejected claims 20 and 21 under 35 U.S.C. § 112, first paragraph 
because, according to the Examiner, these claims contain "subject matter which was not 
described in the specification in such a way as to reasonably convey to one skilled in the relevant 
art that the inventors, at the time the application was filed, had possession of the claimed 
invention." Specifically the Examiner is of the opinion that "[t]here is absolutely no written 
description in the instant specification of an allelic variant of a protein comprising the amino acid 
sequence presented in SEQ ID NO:5 of the instant application." 

In the interest of expediting prosecution, all claims drawn to allelic variants have been 
cancelled, thus, rendering this rejection moot. Accordingly, Applicant respectfully requests that 
the Examiner reconsider and withdraw the foregoing rejection. 

The cancellation of these claims should in no way be construed as an acquiescence to 
any of the Examiner's rejections and was done solely to expedite the prosecution of the 
application. Applicant reserves the right to pursue the claims as originally filed in this or a 
separate application(s). 
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Rejection of Claims 20 and 21 Under 35 U.S.C. § 102(a) as Being Anticipated 

by Julius et al and Davis et al. 

The Examiner has rejected claims 20 and 21 as being anticipated by Julius et al (WO 
99/37675) and by Davis et al (WO 99/37765). The Examiner states that SEQ ID NO:5 of the 
instant application is identical to SEQ ID NO:36 of Julius et al and SEQ ID NO:2 of Davis et al 

Applicant respectfully traverses this rejection for the following reason. The instant 
application claims and is entitled to priority to U.S. Patent Application Serial No: : 09/258,633 
filed on February 26, 1999. Julius et al (WO 99/37675) and by Davis et al (WO 99/37765) 
both published on July 29, 1999. Accordingly, Applicant respectfully submits that the Julius et 
al and Davis et al references, which have an effective prior art date after the effective filing 
date of the pending application, are not available as prior art against the present application 
and, thus, respectfully requests that the Examiner reconsider and withdraw this rejection. 

Notwithstanding that Julius et al and Davis et al are not available as prior art, Applicant 
respectfully notes that the cited sequences are not identical to those of SEQ ID NO: 5 of the 
instant application. A sequence alignment of SEQ ID NO:36 of Julius et al and SEQ ID NO:5 
of the instant application shows that there is a single amino acid insertion at residue 420 of SEQ 
ID NO:5 that is not present in SEQ ID NO:36. A sequence alignment of SEQ ID NO:2 of 
Davis et al and SEQ ID NO:5 of the instant application shows multiple mismatches in the 
amino acid sequence between residues 310 and 565 of SEQ ID NO:5. For the Examiner's 
convenience a copy of both sequence alignments are provided herewith as Appendices G and H, 
respectively. 

A sequence alignment of SEQ ID NO:36 of Julius et al and SEQ ID NO:20 of the 
instant application shows multiple gaps and mismatches in the amino acid sequence beginning 
at residue 450 of SEQ ID NO:20. A sequence alignment of SEQ ID NO:2 of Davis et al and 
SEQ ID NO:20 of the instant application shows an insertion of approximately 130 amino acid 
residues beginning at 533 of SEQ ID NO:20. For the Examiner's convenience a copy of both 
sequence alignments are provided herewith as Appendices I and J, respectively 
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SUMMARY 

If a telephone conversation with Applicant's attorney would expedite the prosecution of 
the above-identified application, the Examiner is urged to call the undersigned at (617) 227- 
7400. 



Respectfully submitted, 
LAHIVE & COCKFIELD, LLP 




Attorney for Applicant 

Limited Recognition Under 37 CFR § 10.9(b) 

28 State Street 
Boston, MA 02109 
Telephone: (617) 227-7400 
Facsimile: (617)742-4214 

i 

Dated: January 3 1 , 2002 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 

Figures 1 A-D depicts the full length cDNA sequence and predicted amino acid sequence 
of human VR-1 (hVR-1). The nucleotide sequence corresponds to nucleic acids 1 to 3909 of 
SEQ ID NO: 1 . The amino acid sequence corresponds to amino acids 1 to 839 of SEQ ID NO:2. 
The coding region without the 5' and 3* untranslated regions of the human VR-1 (hVR-1) gene is 
shown in SEQ ID NO:3. 
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Appendix A 

27. A method for identifying a compound which binds to a polypeptide comprising 
the amino acid sequence of SEQ ID NO:5, the method comprising: 

a) contacting a cell expressing the polypeptide with a test compound under 
conditions suitable for binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising the amino acid 
sequence of SEQ ID NO:5. 

28. A method for identifying a compound which binds to a polypeptide comprising 
the amino acid sequence of SEQ ID NO:20, the method comprising: 

a) contacting a cell expressing the polypeptide with a test compound under 
conditions suitable for binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising the amino acid 
sequence of SEQ ID NO:20. 

29. A method for identifying a compound which binds to a polypeptide consisting of 
the amino acid sequence of SEQ ID NO: 5, the method comprising: 

a) contacting a cell expressing the polypeptide with a test compound under 
conditions suitable for binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide consisting of the amino acid 
sequence of SEQ ID NO:5. 

30. A method for identifying a compound which binds to a polypeptide consisting of 
the amino acid sequence of SEQ ID NO:20, the method comprising: 



Serial Number: 09/587,11 



- 17- 



Group Art Unit: 1646 



a) contacting a cell expressing the polypeptide with a test compound under 
conditions suitable for binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide consisting of the amino acid 
sequence of SEQ ID NO:20. 

31. A method for identifying a compound which binds to a polypeptide comprising at 
least 15 contiguous amino acids of SEQ ID NO:5, the method comprising: 

a) contacting a cell expressing the polypeptide with a test compound under 
conditions suitable binding; and 

b) determining whether the test compound binds to said polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising at least 15 contiguous 
amino acids of SEQ ID NO:5. 

32. The method of any one of claims 27-3 1 , wherein binding of the test compound to 
the polypeptide is detected by the use of an assay for a hVR-2 activity. 

33. The method of claim 32, wherein said hVR-2 activity is modulation of membrane 
depolarization. 

34. The method of claim 32, wherein said hVR-2 activity is modulation of 
intracellular calcium levels. 

35. The method of any one of claims 27-31, wherein said cell expressing said 
polypeptide is a neuronal cell. 

36. The method of any one of claims 27-3 1 , wherein said compound modulates the 
activity of said polypeptide. 
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37. A method for identifying a compound which binds to a polypeptide comprising 
the amino acid sequence of SEQ ID NO:5, the method comprising: 

a) contacting a sample comprising the polypeptide with a test compound 
under conditions suitable for binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising the amino acid 
sequence of SEQ ID NO:5. 



38. A method for identifying a compound which binds to a polypeptide comprising 
the amino acid sequence of SEQ ID NO: 20, the method comprising: 

a) contacting a sample comprising the polypeptide with a test compound 
under conditions suitable for binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising the amino acid 
sequence of SEQ ID NO:20. 



39. A method for identifying a compound which binds to a polypeptide consisting of 
the amino acid sequence of SEQ ID NO:5, the method comprising: 

. : a) contacting a sample comprising the polypeptide with a test compound 
under conditions suitable for binding; and 

b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide consisting of the amino acid 
sequence of SEQ ID NO:5. 



40. A method for identifying a compound which binds to a polypeptide consisting of 
the amino acid sequence of SEQ ID NO:20, the method comprising: 

a) contacting a sample comprising the polypeptide with a test compound 
under conditions suitable for binding; and 
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b) determining whether the test compound binds to the polypeptide, thereby 
identifying a compound which binds to a polypeptide consisting of the amino acid 
sequence of SEQ ID NO:20. 

41 . A method for identifying a compound which binds to a polypeptide comprising at 
least 15 contiguous amino acids of SEQ ID NO:5, the method comprising: 

a) contacting a sample comprising the polypeptide with a test compound 
under conditions suitable for binding; and 

b) determining whether the test compound binds to said polypeptide, thereby 
identifying a compound which binds to a polypeptide comprising at least 15 contiguous 
amino acids of SEQ ID NO:5. 

42. The method of any one of claims 27-3 1 or 37-41, wherein binding of said test 
compound to said polypeptide is detected by the use of a yeast two-hybrid assay. 

43. The method of any one of claims 37-41 , wherein binding of said test compound to 
said polypeptide is detected by the use of a direct binding assay. 

44. The method of any one of claims 37-41, wherein binding of said test compound to 
said polypeptide is detected by the use of a competition binding assay. 

45. The method of any one of claims 37-41, wherein said test compound 
modulates the activity of said polypeptide. 
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^pendix E | 

New assay technologies for high-throughput screening 

Uuren Silverman, Robert Campbell and James R Broach* 



The use of high-throughput screening for ©arty stage drug 
discovery imposes several constraints on the format of assays 
for therapeutic targets of interest Homogeneous cellared 
assays based on energy transfer, fluorescence polarization 
spectroscopy or fluorescence correlation spectroscopy 
provide the sensitivity, ease, speed and resistance to 
interference from test compounds needed to function to a 
high-throughput screening mode. Similarly, novel celt-based 
assays are now being adapted for high-throughput screening, 
providing for in situ analysis of a variety of biological targets. 
Rnally, recent advances in assay miniaturization mark a 
transition to uKra high-throughput screening, ensuring that 
•dentmcation of lead compounds will not be the rate-limiting 
Step in finding new drugs. 
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Abbreviations 

CftE cAMP response element 

FCS fluorescenca correlation spectroscopy 

OFP green fluorescent protein 

HTS hlgh-throuohput screening 



Introduction 

Continuing advances in molecular biology, human genetics 
and genomics have accelerated identification of the mech- 
anisms underlying a growing number of human diseases. 
This progress has increased the number of novel protein 
targets available for potential therapeutic intervention 
by drug treatment Concurrently, novel approaches in 
combinatorial chemistry and expanded collections of 
natural products have dramatically increased the number 
of compounds that can be tested for activity against these 
targets. The confluence of these two trends towards more 
potential targets and larger chemical libraries has greatly 
stimulated adoption of high-throughput screening (HTS) 
as the primary cool for early stage drug discovery. 

HTS is die process by which large numbers of compounds 
are tested, in an automated fashion, for activity as 
inhibitors or activatora of a particular biological target, such 
as i cell surface receptor or a metabolic enzyme, Although 
any assay performed on the bench top can, in theory, 
be applied in HTS, conversion to an automated format 
imposes certain constraina that affect the design of the 
essay in practice. Procedures that arc routine at the bench 



are often extremely difficult to automate. Also, the more 
steps required for an assay, the more difficult to automate 
the HTS- The ideal assay is one that can be performed in a 
single well with no other manipulation other than addition 
of the sample to be tested. 

A number of assay formats have been developed or 
modified over the past few years to conform to the 
constraints imposed by HTS. These assay protocols can 
be divided into two groups: cell-free assays that measure 
the biological activity of a relatively pure protein target and 
cell-based assays that assess the activity of a target, protein 
by monitoring a biological response of a cell in which the 
target protein resides. In either case, the protocols require 
minimal manipulations* can be performed robotically in 
relatively small volumes, yield robust responses and are 
relatively impervious to perturbation by solvents and 
compounds used in drug screening. In this review wc 
describe several of the more recently developed or 
exploited assay protocols for HTS. 

Cell-free assays 

The primary goal in adapting cell-free assays to HTS is 
to minimize the number of steps required in setting up 
the assay and in detecting the activity; be it an enzymatic 
reaction or the binding of two components. This goal has 
been met to a large extent by development of detection 
systems that do not require separation of the product of the 
reaction from substrate, or from other components of the 
assay mixture. Earlier approaches to such homogeneous 
assay formats relied on proximity-dependent energy 
transfer. The output of such assays derived from the 
signal enhancement generated by bringing a source and a 
distance-dependent amplifier close together. For example, 
the fJ-particles of a low-energy radionuclide attached to 
a ligand will stimulate the fluorescent emission of a 
scintfllant in a bead to which the UganxTs receptor is 
attached (1,2]. More recently, this detection method has 
been applied to enzymatic reactions, such as that catalyzed 
by topoisomemsc I (3]. As another example of energy 
transfer assay formats, the rare earth metal lanthanide, 
EuZ+ 9 when irradiated by light, can transfer its excitation 
energy in a nonradiativc process to the fluorescent protein, 
allaphycoeyanin, if the two are in close proximity: This 
can occur when a Eu**-derivitized ligand binds to an 
allopbycocyantn-Unkcd receptor &5] or a EuZ*-dertvitized 
anti-phospho tyrosine antibody bends to a detccox-lsnkcd 
ptosphorylaced subsxraoe of a tyrosine kinase such as arc 
[6*]. Use of time resolved fluorescent procedures assessing 
emission at specific rimes following excitation e nh a n c es 
the sensitivity of this technique by reducing interference 
from background fluorescence, from test compounds or 
from assay components (6*,7*]* Finally enzymatic assays 
suitable for HTS and based on fluoresc en t resonant energy 
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transfer between two different forms of green fluorescein 
protein (GFp) have recently been described (8 # J. 

A number of investigators have exploited fluorescence po- 
larization spectroscopy (FPS) as the basis for homogeneous 
HTS assays of both enzymatic and binding reactions. 
When fluorescent molecules in solution arc excited with 
polarized light, the degree to which the emitted light 
retains polarization depends on the extent to which the 
fluorescent molecule rotates during the interval between 
excitation and emission. The rapid rotation of small 
fluorescent molecules in solution results in substantial 
loss of polarization. If such small molecules bind to 
larger molecules, their rotational diffusion is reduced and 
the retention of polarization is correspondingly increased. 
Thus, by measuring the relative intensity of emitted light 
in the planes normal and orthogonal to the plane of tie 
incident polarized light, the extent of rotation of a target 
molecule, and inferentially, the extent of binding of the 
target molecule to a larger component, can be calculated. 
For instance, fluorescent polarization has been used to 
detect the presence of specific drugs or hormones (9,10], to 
assess antibody binding of Auorcsocin-conjugated peptides 
llll or to monitor DNA; DNA hybrid formation (12]. The 
recent availability of a %-well plate reader [13] with a 
high sensitivity to fluorescein and fluorescein conjugates 
has allowed development of 96- well based fluorescent 
polarization assays. Such high-throughput assays for sre 
family tyrosine kinase activity 1 14*1, for binding of 
phosphopeprides to Src SH2 domains (1S"] # for interaction 
between STATt and an y-incerferon receptor-derived 
phusphotyrosinc-containing peptide (16'] and for specific 
protease activities 117,18*1 have recently been described. 
The sensitivity of fluorescence polarization, the ease and 
speed with which such assays can be run and the resistance 
of such assays to interference from absorptive compounds 
commonly present in complex mixtures (18*1 make this 
procedure highly amenable to HTS. 

Fluorescence correlation spectroscopy (FCS) represents 
another recently developed detection format eminently 
suitable for HTS. FCS measures differences in physical 
states of a target molecule, such as bound versus free 
or cleaved versus intact, in a homogeneous mixture 
ll°| Specifically, FCS measures the burst of fluorescent 
emission of a molecule passing through a small volume of 
space, which is defined by a sharply focused laser beam. 
Small molecules diffuse through the volume rapidly and 
thus yield short burses of light. Binding of these small 
molecules to larger molecules reduces their rxanslatiooal 
diffusion and correspondingly increases the duration of the 
hursts of light. Dcconvolution of the emission patterns 
in a sample by appropriate software can yield the 
relative amount of the bound and unbound states of a 
fluorcscendy tagged bgand. This technology can therefore 
readily fee applied to measure rcccpcar-Ugand tatenftcdons, 
DNA-protein interactions, nucleic add hybrid formation 
and certain cntymatic reactions (20]. 



Cell-based assays 

Cell-based assays arc an increasingly attractive alternative 
to i* vim biochemical assays for HTS. Such im vivo assays 
require an ability to examine a specific cellular process 
and a means to measure its output. For instance, agonist 
activation of a cell surface receptor or a ligand-gated 
ion channel can elicit a change in the transcription 
pattern of a number of genes. This I i^and -induced 
alteration in transcription can be readily captured by using 
gene fusions, in which a promoter clement responsive: 
to receptor activation is fused to the coding region 
for an enzyme or protein whose levels can be easily 
measured. Appreciation of the particular signaling pathway 
• associated with a specific receptor allows Identification of 
the appropriate transcriptional response clement required 
to detect a response. Figure 1 depicts a number f 
signal transduction pathways, indicating the transcriptional 
response elements coupled to each pathway. Several 
reporter genes that generate products that can be adapted 
to HTS format are available [21,22]. These arc listed in 
Table 1, with references to recent innovations in their use 
123\24,25,26']. For instance, the recent report of novel 
fluorescent, cell-perrneablc substrates for (^lactamase 
documents the use of ^lactamase to detect teceptor 
activation in single cells, making it an attractive assay 
system for high density HTS ]27**1. 

While cell-based assays using reporter genes have proved 
effective as an HTS format, detecting more immediate 
responses to target protein activation provides several ad- 
vantages, including shorter duration of the assay and fewer 
false positives from nonspecific interactions. As indicated 
in Figure 1, such cellular response dependent on activation 
of a receptor include elevation of a second messenger (fpf 
exaraole. Ca 2 *. cAMR inositol triuhosohatc), phosphory- 
lation of an intermediate signaling orotein, or subcellular 
\x aiuluuuioTT of 'arsnrhattne molecule. Recent advances in 
molecular tnoTogy and in instrumentation nave made it 
possible to monitor these events inac automated format. 
For instance, the recent availability of a 96-well fluorescent 
imaging plate reader (Molecular Devices, Sunnyvale, 
California, USA) permits HTS of receptor activation by 
monitoring Ca*+ mobilization of cells preloaded with a 
fluorescent calcium indicator, such as FLUQ4 (Molecular 
Probes, Eugene, Oregon, USA). In addition, recombinant 
cells expressing a calcium-sensitive fluorescent protein, 
such as aequorin (28 s ] or a hybrid calmodulin-GFP protein 
(29**], obviate the need for preloading cells with dyes 
in order to detect calcium fluxes following stimulation. 
A separate approach to detecting carry events following 
receptor stimulation involves examining rclocaUzation of 
specific components of the signal transduction machinery. 
For instance, MAP kinase (Figure 1) relocaltxes from 
the cytoplasm to the nucleus within minutes following 
sti mu la ti o n of on upstream (>pracin -coupled receptor 
(30,31). Similarly, Barak H */« J32*l have shown chat 
recruitment 

of a. flrirrrsfiTv-OEP. fusion to (he 

plasma membrane can be used eo monitor ■crivatioa 
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_ a number of different C^BtcifeCQiiQlQi, 
Rct-edt advances in microscopic imaging technology, ia 
conjunction with sofcwaie pcxmicdxig automated image 
recognition, provide a means to capture these events in 
a hi^Khtougfrput mode. 



Cell-based assays have significant advang^s jmxf im dire 
assays. Fuse the starting material (the cctl) serf-fcoEcatc*, 
aveftutg the investment involved in iucpajJne* purifed 
taigct, in cnemtcalty modn^ui* the sugx to .suit she 
screen* and so oil. Second, the targets and readouts axe cx- 
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Table 1 

Reporter genes useful lor catt-bas«d Hfett-cftuotirjhput scroocKrtg. 



gepeS ts0uf °^ AdvanUQes Disadvantages Retoervcee 

P-oalactosidase Well chaiacteroad; stable, inaxpoosive Endogenous activity (mammalian (23».501 

(bacterial) substrates; highly sensitive fkjoreacem calls); tatrameric (nonlinear 

or chem&iminesceftt substrates avofebla; response at low concentration) 

little irrterferonce from test compounds, 

simply readouts (readily automated) 

Ludferase Dimoric; high specific activity; no Requires addition of cefaclor 123*1 

(firefly) endogenous activity (Sow background) Cucaferini and presence of 0 2 

and ATP 

Alkaline Secreted protein (avoids tho need for Endogenous activity in some cell (24,25] 

phosphatase rnen*iran*-penneabte substrates); types; optimal at pH9.8 

(human placental) inexpensive cotorimetric and highly 

sensitive luminescent assays avaiable 

lactamase Monomelic; highly sensitive Marribrane-perrfieabto 127~I 

(bacterial) fluorogenic substrates described; fluorescent substrates not readily 

_ no endogenous activity ave3able 

GFP (jellyfish) Monomoric; no substrate needed tno Relatively tow spocrfic activity (26*51,52] 

rnantputationa required f or assay); no 
endogenous activity; multiple forms 
available 



amincd in a biological context that more faithfully mimics 
the normal physiological situation. Third, cell-based assays 
can provide insights into bioavailability and cytotoxicity. 
Mammalian cells arc expensive to culture and difficult 
to propagate in the automated systems used for HTS, 
however. 

An alternative CO mammalian cell based assays is to 
recapitulate the desired human physiological process in a 
micro-organism such as yeast (33J, For instance, signaling 
via human G-protcin-couplcd receptors has been reconsti- 
tuted in yeast to yield a facile growth response or a reporter 
gene readout ((34451; Klein it «/♦. unpublished data). 
Similarly, mammalian ion channels have been coupled 
to growth response in yeast (36J. Also, protein-protein 
interactions, including RA&-RAF association (371 and 
tyrosine kinase rcceptor-ligand binding 138], have been 
faithfully reproduced using the yeast two-hybrid system. 
Finally, many mammalian transcription factors operate in 
yeast, including glucocorticoid receptor (39,40] and the 
retinok acid receptor and retinoid X receptor families of 
receptors 141]. The ease and low cost of growing yeast, 
their ready genetic manipulation, and their resistance to 
solvents make yeast mn attractive option for cdl-bascd 
HTS. 

Miniaturization 

Several factors arc fueling efforts to increase the speed 
of HTS and decrease the volume of individual reactions 
within an HTS format. Split-bead synthesis (see Note 
added in proof), or other similar approaches to combi- 
natorial chemistry, dramatically increases the number of 
compounds that can be produced in a library but do so at 
the cost of quantity of material In addition, the limited 
supply of existing compounds within chemical libraries 



of pharmaceutical companies, and the growing number 
of targets against which such compounds can be tested, 
motivate a frugal approach to use of those compounds. 
Finally, the reagent costs associated "with HTS, when 
multiplied by the increasing number of assays per run, arc 
becoming a significant cost of early stage drug discovery. 

In response to these exigencies, a number of groups 
have begun to develop formats for very high density 
screening using very small assay volumes. One approach 
involves reducing the well size and increasing the density 
of the assay plate but retaining the overall assay format 
used in current %-wcll based HTS. Densities of 6S00 
assays in a 10 cm array have been reported for cell -free 
enzyme based assays (42*] and for ligand binding in 
cell based assays {43**]. This approach of miniaturizing 
existing formats significantly increases the number of 
assays per plate and the overall throughput of the screen 
but is intrinsically limited by the physical constraints 
of delivering small volumes to wells, and of detecting 
responses in a sensitive and timely manner. Accordingly, 
novel formats have been developed that eschew the 
assay format based on welts* One approach uses glass 
chips containing rnicrocharinels in which reagents, target 
proteins and compounds are herded by clectrokinetic flow 
controlled by electric potentials applied at the ends of the 
channels (44«J, A related approach attains high-throughput 
both of chemical synthesis and activity assessment by 
parallel arrays of truec^imenstonal channels in which 
flow is controlled by miniature hydrostatic actuators [AS]. 
These approaches provide significant reduction in the 
volume of assays and a corresponding savings in reagent 
costs over conventional HTS (45]. In addition, with 
further development in parallel processing in multiple 
chips, the number of assays performed in a given period 
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of time can increase dramatically. This movement co 
miniaturization is likely to ensure that the initial stage of 
drug discovery identification of lead compounds will not 
be the race-limiting step in finding new drufcs. 

Conclusions 

The last decade has witnessed the emergence across the 
pharmaceutical industry of the 96-wdl-bascd, robotics- 
driven, hi^h-chroufchput scrccnin K process as the primary 
tool for identifying active compounds in the first stage 
of drug discovery. This program has dictated the format 
of the assays that arc used to assess che activities of tar- 
gets—enzymes, receptors, transporters and so on — that 
underlie drug discovery in various therapeutic areas. 
A number of such formats — resonant energy transfer 
and fluorescent polarization spectroscopy in cell-based 
assays — have gained widespread acceptance and growing 
incorporation into high-throughput screening programs. 
The growing number of potential therapeutic targets, 
the increasing number of screcniblc compounds, the 
accelerating costs of screening and the increasing pressure 
to generate more lead compounds in a shorter time alt 
conspire to render even die new approaches inadequate for 
meeting die anticipated throughput requirements, how- 
ever. Thus, we we likely to witness a movement towards 
even greater screening throughput by miniaturization and 
increased reliance on robotics. Whether a new standard 
format for screening emerges in the near future, or whether 
a variety of formats are pursued concurrendy remains 
to be seen. Nonetheless, we can anticipate that the 
exigencies of drug screening will motivate a continued 
application of state-of-the-art technologies to the process 
of high-throughpuc screening. 

N te added in proof 

For a reference describing split-bead synthesis, sec (S3]. 
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Appendix G 

lalign output for SEQ ID NO:5 vs. SEQ ID NO: 36 



[ISREC-Server] Date: Mon Jan 14 20:59:50 MET 2002 



reset matrix file to /export/molbio/share/fasta2/blosum80.mat ./wwwtmp/lalign/.24621.1.seq : 764 aa 

ALIGN calculates a global alignment of two sequences 

version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 4:11-17 
SEQ ID NO: 5 764 aa vs. 

SEQ ID NO: 36 725 aa 

scoring matrix: /export /molbio/share/f asta2/blosum80 .mat , gap penalties: -14/-4 
88.3% identity; Global alignment score: 34 93 

10 20 30 40 50 60 

. /wwwt MTSPSSSPVFRLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPQIRVNLNY 



SEQ MTSPSSSPVFRLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPQIRVNLNY 
10 20 30 40 50 60 



70 80 90 100 110 120 

. /wwwt RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTGKTCL 



SEQ RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTGKTCL 
70 80 90 100 110 120 

130 140 150 160 170 180 

. /wwwt MKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSALHIAIEKRSLQCVK 



SEQ MKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSALHIAIEKRSLQCVK 
130 140 150 160 170 180 

190 200 210 220 230 240 

. /wwwt LLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKQWDVVSYLLENPHQPASLQA 



SEQ LLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKQWDVVSYLLENPHQPASLQA 
190 200 210 220 230 240 

250 260 270 280 290 300 

. /wwwt TDSQGNTVLHALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQLEDIRNLQDLTPLKL 



SEQ TDSQGNTVLHALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQLEDIRNLQDLTPLKL 
250 260 270 280 290 300 

310 320 330 340 350 

. /wwwt AAKEGKIEIFRHILQR-EFSGLSH-LSRKFTEW-CYGPVRVSLYDLASVDSCEENSVLEI 

SEQ AAKEGKIGI FRHILASGKFSGLKPPFPRKFTEWWLMGPVRVSLYDLASVDSCEENSVLGI 
310 320 330 340 350 360 

360 370 380 390 400 410 

. /wwwt IAFHCKSPHRHRMVVLEPLNKLLQAKWDLLIPKFFLNFLCNLIYMFIFTAVAYHQPTLKK 



SEQ IAFHCKSPDRHRMVVLEPLNKLLQAKWDLLIPKFFLNFLCNLIYMFIFTAVAYHQPTLKK 
370 380 390 400 410 420 

420 430 440 450 460 470 

. /wwwt QAAPHLKAEVGNSMLLTGHILILLGGI YLLVGQLWYFWRRHVFIWISFIDSYFEILFLFQ 
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lalign output for SEQ ID NO:5 vs. SEQ ID NO: 36 http://www.ch.embnet.org/cgi-bin/LALIGN_form_parser 

SEQ QAAPHLKAEVGNSMLLTGHILILLGGIYLLVGQKWKFW IWI-FIDSYFEILFLFP 

430 440 450 460 470 



480 490 500 510 520 530 

./wwwt ALLTVVSQVLCFLAIEWYLPLLVSALVLGWLNLLYYTRGFQHTGI YSVMIQKVILRDLLR 

SEQ GH-RVVPAPACVCA GAGLAE PALL YTWL PAHRHLQCHD 

480 490 500 510 

540 550 560 570 580 590 

. /wwwt FLLIYLVFLFGFAVALVSLSQEAWRPEAPTGPNATESVQPMEGQEDEGNGAQYRGILEAS 



SEQ PEALVSLSQD-WRPEAPTGPNATESVQPMEGQEDEGNGAQYRGILEAS 

520 530 540 550 

600 610 620 630 640 650 

. /wwwt LELFKFTIGMGELAFQEQLHFRGMVLLLLLAYVLLTYILLLNMLIALMSETVNSVATDSW 



SEQ LELFKFTIGMGELAFQEQLHFRGMVLLLLLAYVLLTYILLLNMLIALMSETVNSVATDSW 
560 570 580 590 600 610 

660 670 680 690 700 710 

. /wwwt SIWKLQPCAISVLEMENGYWWCRKKQRAGVMLTVGTKPDGSPDERWCFRVEEVNWASWEQT 



SEQ SIWKLQKAISVLEMENGYWWCRKKQRAGVMLTVGTKPDGSPDERWCFRVEEVNWASWEQT 
620 630 640 650 660 670 

720 730 740 750 760 

. /wwwt LPTLCEDPSGAGVPRTLENPVLASPPKEDEDGASEENYVPVQLLQSN 



SEQ LPTLCEDPSGAGVPRTLENPVLASPPKEDEDGASEENYVPVQLLQSN 
680 690 700 710 720 
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l Giitput for SEQ ID NO:5 vs. SEQ ID NO: 2 http://www.ch.embnet.org/cgi-bin/LALIGN_form_parse] 

I 

Appendix H -J 

j§ lalign output ibr SEy iw rNu:^ vsr^j^v^^^^^* 4 



[ISREC-Server] Date: Mon Jan 14 19:59:06 MET 2002 



reset matrix file to /export/molbio/share/fasta2/blosum80.mat ./wwwtmp/lalign/.9148.1.seq : 764 aa 

ALIGN calculates a global alignment of two sequences 

version 2 . OuPlease cite: Myers and Miller, CABIOS (1989) 4:11-17 
SEQ ID NO:5 764 aa vs. 

SEQ ID NO: 2 763 aa 

scoring matrix: /export /molbio/share/f asta2/blosum80 . mat , gap penalties: -14/-4 
99.9% identity; Global alignment score: 4297 

10 20 30 40 50 60 

. /wwwt MTSPSSSPVFRLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPQIRVNLNY 



SEQ MTSPSSSPVFRLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPQIRVNLNY 
10 20 30 40 50 60 



70 80 90 100 110 120 

. /wwwt RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTGKTCL 



SEQ RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTGKTCL 
70 80 90 100 110 120 

130 140 150 160 170 180 

. /wwwt MKAVLNLKDGVNACI LPLLQI DRDSGNPQPLVNAQCTDDYYRGHSALHIAI EKRSLQCVK 



SEQ MKAVLNLKDGVNACI LPLLQI DRDSGNPQPLVNAQCTDDYYRGHSALHIAI EKRSLQCVK 
130 140 150 160 170 180 

190 200 210 220 230 240 

. /wwwt LLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKQWDVVSYLLENPHQPASLQA 



SEQ LLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKQWDVVSYLLENPHQPASLQA - 
190 200 210 220 230 240 

250 260 270 280 290 300 

. /wwwt TDSQGNTVLHALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQLEDIRNLQDLTPLKL 



SEQ TDSQGNTVLHALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQLEDIRNLQDLTPLKL 
250 260 270 280 290 300 

310 320 330 340 350 360 

. /wwwt AAKEGKIEIFRHILQREFSGLSHLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEIIAF 



SEQ AAKEGKIEIFRHILQREFSGLSHLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEIIAF 
310 320 330 340 350 360 

370 380 390 400 410 420 

. /wwwt HCKS PHRHRMVVLEPLNKLLQAKWDLLI PKFFLNFLCNLI YMFI FTAVAYHQPTLKKQAA 



SEQ HCKS PHRHRMVVLEPLNKLLQAKWDLLI PKFFLNFLCNLI YMFI FTAVAYHQPTLKK-AA 
370 380 390 400 410 

430 440 450 460 470 480 

. /wwwt PHLKAEVGNSMLLTGHILILLGGIYLLVGQLWYFWRRHVFIWISFIDSYFEILFLFQALL 
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sutput tor SEQ ID NO:5 vs. SEQ ID NO: 2 



http://wwwxh.ernbnet.org/cgi-bin/LALIGN_forrn_parser 



SEQ PHLKAEVGNSMLLTGHILILLGGIYLLVGQLWYFWRRHVFIWISFIDSYFEILFLFQALL 
420 430 440 450 460 470 

490 500 510 520 530 540 

./wwwt TVVSQVLCFLAIEWYLPLLVSALVLGWLNLLYYTRGFQHTGIYSVMIQKVILRDLLRFLL 



SEQ TVVSQVLCFLAIEWYLPLLVSALVLGWLNLLYYTRGFQHTGIYSVMIQKVILRDLLRFLL 
480 490 500 510 520 530 

550 560 570 580 590 600 

. /wwwt IYLVFLFGFAVALVSLSQEAWRPEAPTGPNATESVQPMEGQEDEGNGAQYRGILEASLEL 



SEQ IYLVFLFGFAVALVSLSQEAWRPEAPTGPNATESVQPMEGQEDEGNGAQYRGILEASLEL 
540 550 560 570 580 590 

610 620 630 640 650 660 

. /wwwt FKFTIGMGELAFQEQLHFRGMVLLLLLAYVLLTYILLLNMLIALMSETVNSVATDSWSIW 



SEQ FKFTIGMGELAFQEQLHFRGMVLLLLLAYVLLTYILLLNMLIALMSETVNSVATDSWSIW 
600 610 620 630 640 650 

670 680 690 700 710 720 

. /wwwt KLQKAISVLEMENGYWWCRKKQRAGVMLTVGTKPDGSPDERWCFRVEEVNWASWEQTLPT 



SEQ KLQKAISVLEMENGYWWCRKKQRAGVMLTVGTKPDGSPDERWCFRVEEVNWASWEQTLPT 
660 670 680 690 700 710 

730 740 . 750 760 

. /wwwt LCEDPSGAGVPRTLENPVLASPPKEDEDGASEENYVPVQLLQSN 



SEQ LCEDPSGAGVPRTLENPVLASPPKEDEDGASEENYVPVQLLQSN 
720 730 740 750 760 
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Jaligh output for SEQ ID NO:20 vs. SEQ ID NO:36 http://www.ch.embnet.org/cgi-bin/LALIGN_tbrm__parser 

j 

Appendix I j 
lalign output lor SEQ ID NO:20 vs. SEQ ID NO:36 



[ISREC-Server] Date: Tue Jan 29 16:25:59 MET 2002 



Vwwwtmp/lalign/.651.1.seq : 630 aa 

ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 4:11-17 
: SEQ ID NO: 20 630 aa vs. 

SEQ ID NO: 36 725 aa 

scoring matrix: BLOSUM50, gap penalties: -14/-4 
78.2% identity; Global alignment score: 3318 

10 20 30 40 50 60 

. /wwwt MTSPSSSPVFRLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPQIRVNLNY 



SEQ MTSPSSSPVFRLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPQIRVNLNY 
10 20 30 40 50 60 

70 80 90 100 110 120 

. /wwwt RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTGKTCL 



SEQ RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTGKTCL 
70 80 90 100 110 120 

130 140 150 160 170 180 

. /wwwt MKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSALHIAIEKRSLQCVK 



SEQ MKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSALHIAIEKRSLQCVK 
130 140 / 150 160 170 180 

s 

190 200 210 220 230 240 

. /wwwt LLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKQWDVVSYLLENPHQPASLQA 



SEQ LLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKQWDVVSYLLENPHQPASLQA 
190 200 210 220 230 240 

250 260 270 280 290 300 

. /wwwt TDSQGNTVLHALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQLEDIRNLQDLTPLKL 



SEQ TDSQGNTVLHALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQLEDIRNLQDLTPLKL 
250 260 270 280 290 300 

310 320 330 340 350 

. /wwwt AAKEGKIEIFRHILQR-EFSGLSH-LSRKFTEW-CYGPVRVSLYDLASVDSCEENSVLEI 

SEQ AAKEGKIGIFRHILASGKFSGLKPPFPRKFTEWWLMGPVRVSLYDLASVDSCEENSVLGI 
310 320 330 340 350 360 

360 370 380 390 400 410 

. /wwwt IAFHCKSPHRHRMVVLEPLNKLLQAKWDLLIPKFFLNFLCNLIYMFIFTAVAYHQPTLKK 



SEQ IAFHCKSPDRHRMVVLEPLNKLLQAKWDLLIPKFFLNFLCNLIYMFIFTAVAYHQPTLKK 
370 380 390 400 410 420 

420 430 440 450 460 470 

. /wwwt QAAPHLKAEVGNSMLLTGHILILLGGI YLLVGQLWYFWRRHVFIWISFIDSYFEILFLFQ 
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lalign output for SEQ ID NO:20 vs. SEQ ID NO:36 http://wwwxh.embnet.org/cgi-bin/LALIGN_tbrrn_parser 



SEQ QAAPHLKAEVGNSMLLTGHILILLGGIYLLVGQKWKFW IWI-FIDSYFEILFLFP 

430 440 450 460 470 

480 490 500 
. /wwwt ALLTVVSQV-LCF LAIEW YLPLLVSALVLGW 

SEQ GHRVVPAPACVCAGAGLAEPALLYTWLPAHRHLQCHDPEALVSLSQDWRPEAPTGPNATE 
480 490 500 510 520 530 

510 

. /wwwt LNLLYYTRG FQH 

SEQ SVQPMEGQEDEGNGAQYRGILEASLELFKFTIGMGELAFQEQLHFRGMVLLLLLAYVLLT 
540 550 560 570 580 590 

520 530 540 550 

. /wwwt TGI YS VMIQKKAI SVLEMENGYWWCRKKQRAGVMLTVGTK 

SEQ YI LLLNMLIALMSETVNS VATDSWS IWKLQKAI SVLEMENGYWWCRKKQRAGVMLTVGTK 
600 610 620 630 640 650 

560 570 580 590 600 610 

. /wwwt PDGSPDERWCFRVEEVNWASWEQTLPTLCEDPSGAGVPRTLENPVLASPPKEDEDGASEE 



SEQ PDGSPDERWCFRVEEVNWASWEQTLPTLCEDPSGAGVPRTLENPVLASPPKEDEDGASEE 
660 670 680 690 700 710 

620 630 
. /wwwt NYVPVQLLQSN 



SEQ NYVPVQLLQSN 
720 
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output for SEQ ID NO:20 vs. SEQ ID NO:2 http://wwwxh.embnet.org/cgi-bin/LALIGN_torrn_parser 



Appendix J 

Ialign output for SEQ ID JNu:zu vs. oxsm-w ^ 



[ISREC-Server] Date: Tue Jan 29 16:26:37 MET 2002 



7wwwtmp/lalign/.1391.1.seq : 630 aa 

ALIGN calculates a global alignment of two sequences 

version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 4:11-17 
SEQ ID NO: 20 630 aa vs. 

SEQ ID NO: '2 763 aa 

scoring matrix: BLOSUM50, gap penalties: -14/-4 
82.3% identity; Global alignment score: 3710 

10 20 30 40 50 60 

./wwwt MTSPSSSPVFRLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPQIRVNLNY 



SEQ MTSPSSSPVFRLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGEDRKFAPQIRVNLNY 
10 20 30 40 50 60 

70 80 90 100 110 120 

. /wwwt RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTGKTCL 



SEQ RKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLTDSEYTEGSTGKTCL 
70 80 90 100 110 ■ 120 

130 140 150 160 170 180 

. /wwwt MKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSALHIAIEKRSLQCVK 



SEQ MKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSALHIAIEKRSLQCVK 
130 140 150 160 170 180 

190 200 210 220 230 240 

. /wwwt LLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKQWDVVSYLLENPHQPASLQA 



SEQ LLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKQWDVVSYLLENPHQPASLQA 
190 200 210 220 230 240 

250 260 270 280 290 300 

. /wwwt TDSQGNTVLHALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQLEDIRNLQDLTPLKL 



SEQ TDSQGNTVLHALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQLEDIRNLQDLTPLKL 
250 260 270 280 290 300 

310 320 330 340 350 360 

. /wwwt AAKEGKIEIFRHILQREFSGLSHLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEIIAF 



SEQ AAKEGKIEIFRHILQREFSGLSHLSRKFTEWCYGPVRVSLYDLASVDSCEENSVLEIIAF 
310 320 330 340 350 360 

370 380 390 400 410 420 

. /wwwt HCKSPHRHRMVVLEPLNKLLQAKWDLLIPKFFLNFLCNLIYMFIFTAVAYHQPTLKKQAA 



SEQ HCKSPHRHRMVVLEPLNKLLQAKWDLLIPKFFLNFLCNLI YMFIFTAVAYHQPTLKK-AA 
370 380 390 400 410 

430 440 450 460 470 480 

. /wwwt PHLKAEVGNSMLLTGHILILLGGI YLLVGQLWYFWRRHVFIWISFIDSYFEILFLFQALL 



1 of 2 



output for SEQ ID NO:20 vs. SEQ ID NO:2 http://wwwxh.embnet.org/cgi-bin/LALIGN_tbrrn_parser 

1 

SEQ PHLKAEVGNSMLLTGHILILLGGIYLLVGQLWYFWRRHVFIWISFIDSYFEILFLFQALL 
420 430 440 450 460 470 

490 500 510 520 
. /wwwt TVVSQVLCFLAIEWYLPLLVSALVLGWLNLLYYTRGFQHTGIYSVMIQK 



SEQ TVVSQVLC FLAI EWYLPLLVSALVLGWLNLLYYTRGFQHTGI YS VMI QPCVI LRDLLRFLL 
480 490 500 510 520 530 



. / wwwt 

SEQ IYLVFLFG FAVALVSLSQEAWRPEAPTGPNATESVQPMEGQEDEGNGAQYRGILEASLEL 
540 550 560 570 580 590 



. /wwwt 

SEQ FKFTIGMGELAFQEQLHFRGMVLLLLLAYVLLTYILLLNMLIALMSETVNSVATDSWSIW 
600 610 620 630 640 650 

530 540 550 560 570 580 

. /wwwt KAISVLEMENGYWWCRKKQRAGVMLTVGTKPDGSPDERWCFRVEEVNWASWEQTLPT 



SEQ KLQKAISVLEMENGYWWCRKKQRAGVMLTVGTKPDGSPDERWCFRVEEVNWASWEQTLPT 
660 670 680 690 700 710 

590 600 610 620 630 

. /wwwt LCEDPSGAGVPRTLENPVLASPPKEDEDGASEENYVPVQLLQSN 



SEQ LCEDPSGAGVPRTLENPVLASPPKEDEDGASEENYVPVQLLQSN 
720 730 740 750 760 
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